STABILITY OF ROOTS OF POLYNOMIALS 
UNDER LINEAR COMBINATIONS OF DERIVATIVES 



BRANKO CURGUS AND VANIA MASCIONI 

Abstract. Let T = a I + aiD + ■•■ + a„D n , where D is the differen- 
tiation operator and ao 7^ 0, and let / be a square-free polynomial with 
large minimum root separation. We prove that the roots of Tf are close 
to the roots of / translated by —cti/ao. 



1. Introduction 

Let n be a positive integer. Denote by V n the (n + l)-dimensional complex 
vector space of all polynomials of degree at most n. By Vq denote the set 
of all constant polynomials. For a non-zero / G V n let 

Z(f) = {w€C: f(w) = 0} 

be the set of the roots of /, where C is the field of complex numbers. It is 
useful to extend the field operations from C onto the nonempty subsets of 
C in the following standard way: for A,5cCwe set 

A + B = {u + v : u G A,v G B) and AB = {uv : u G A,v G S}. 

For r > we set D(r) = {zGC:|2;|<r}. For example, the set Z(f) +B(r) 
is the union of the closed disks of radius r centered at the roots of /. 

Let £(V n ) be the set of all linear operators from V n to V n - How does an 
operator T G C(V n ) perturb the roots of polynomials? To illustrate what 
we mean by this question consider two simple linear operators on V n . Let 
a, t G C, t ^ 0. For / G V n we set 

{S(a)f)(z) :=f(a + z), 

(H(t)f)(z) :=f(z/t). 

Then for all non-constant / G V n we clearly have 

Z(S(a)f)={-a} + Z(f), 

Z{H(t)f)={t}Z(f). 

Hence, for these two classes of operators there is a simple relationship be- 
tween the roots of the original polynomial and the roots of its image. In 
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contrast, for the differentiation operator D : V n — > V n and / £ V n there 
is no simple relation between Z(Df) and Z(f). The classical Gauss-Lucas 
theorem and its many improvements address this question; see for example 
the excellent monograph on the subject [9] and [6] for a recent development. 

In this article we explore the relative position of Z(Tf) in relation to Z{f) 
for those invertible T £ C(V n ) that are linear combinations of /, D, . . . , D n . 
To that end we first define a simple measure of the perturbation of the roots 
under T £ C(V n ). For a non-constant polynomial / £ 7-^ set 

R T (f) := min{r > : Z(Tf) C Z(/) +D(r)}. 

Clearly, for the monomial <j) n {z) ■= z n we have 

RT(4>n) = max{|w| : t> £ Z(T(f) n )}. 

In [5] we proved that the following two statements are equivalent (see 
Theorem 12.61 below) : 

(i) For all non-constant polynomials / £ V n we have 

Rr(f) < Rt(<M. 

(ii) There exist cxq, a\, . . . , a n £ C such that 

(1.1) and T = a$I + a\D H + a n D n . 

Hence, an operator T given by (jl.ip perturbs the roots of cj> n the most, as 
measured by Rt- Since <f> n only has one root of multiplicity n it is plausible 
to surmise that n distinct roots that are far apart from each other will be 
perturbed considerably less by T. But is this correct? 

The following few lines of Mathematica code will help explore this ques- 
tion. In the code below, T stands for a list 

{a ,ai, . . .,a n }, a / 0, 

of the n + 1 coefficients of the operator T in (|l.lj) and W stands for the list 
of the n roots of an / £ V n . First we calculate the roots of Tf and name 
them TW: 

TW := z /. NSolve[ 

T. (D [Times@@(z - W),{z, #}]&/@Range [0, Length [W]] ) == 0,z 
] 

Then we plot the roots W as gray points and the perturbed roots TW as black 
points by the following command: 
Show [Graphics [{ 

{AbsolutePointSize [5] , GrayLevel [0 . 7] , 

Point [{Re [#] , Im [#]}]&/© W>, 
{AbsolutePointSize [5] , Point [{Re [#] , Im [#]}]&/© TW> 
>]] 

Applying these two commands to sets W consisting of distinct points that 
are far apart from each other the first author observed the following surpris- 
ing fact: the points of TW were very close to the points {— ai/ao}+W. That 
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Figure 1: n = 5 and a = 45 Figure 2: A zoom-in on roots 



is, the points of W had been essentially translated by — a\/ao- This kind of 
numerical experiment leads us then to believe that if the roots of / G V n 
are simple and far apart from each other, then the roots of Tf are close to 
the roots of S(ai/ao)f. 

Next we explore this conjecture with three simple examples. Let a > 
(ji!) 1 /™, ip a ,n( z ) '■= zn ~ °™' and T = I + D n . Here a x = 0, so the roots of 
Tip a ,n and ijj a>n should be close for large a. Since (Tip a!n )(z) = z n — (a n — n\), 
if we pair the roots of ip a ,n and Tip a ^ n with the same argument, then their 

moduli differ by a— {a n — n!) 1//n . This quantity indeed tends to as a — > +oo. 
For an even n > 2 one can also consider T = I + iW 2 and arrive at the 
same conclusion. For T = I + D we have (Tijj atn )(z) = z n + nz n ~ l — a n 
and ar/ao = 1- We proceed with re = 2, since the roots of Tip a ^ are easily 
calculable only for this case. Then 

Z{T^ 2 ) = {-1 - \A + a 2 , -1 + \A + a 2 }- 
To test our conjecture in this case, we consider the quantities 



(1.2) -a - (-1- y/l + a 2 ) and a - (-1 + y/T+ 



a 2 



Since both quantities converge to 1 as a — > +oo our conjecture is confirmed 
in this case as well. We offer Figures 1 and 2 as evidence that the same is true 
for n = 5. In the figures the gray points mark the roots of ip45,5( z ) = z 5 — 45 5 , 
the crosses mark the points in {—1} + Z(ip4^^), while the black dots mark 
the roots of (T^ 45i5 )(z) = z 5 + 5z 4 - 45 5 . 



To formulate our conjecture as a formal statement we need to define a 
quantity that will play the role of a in the above simple examples and a 
quantity that will measure how close the roots are. To avoid the discontinu- 
ities caused by collapsed multiple roots, we will only consider polynomials 
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/ in V n with simple roots. Not surprisingly, it turns out that the appropri- 
ate generalization for a is the minimal distance between points of Z(f) and 
points of Z(f'). We denote this quantity by r(f), see Definition 12. II below. 
As a measure how close the roots of two polynomials with the same degree 
are we will use the Frechet distance dp: this distance is obtained by pairing 
the roots of two polynomials to get the smallest possible maximal distance 
between the paired roots, see Definition 12.51 Then our conjecture leads us 
to the following proposition which is one of the main results of this article: 

Proposition. Let T be given by (11. ip . Then for each e > there exists 
Cr(e) > such that for all f G V n with simple roots we have 

r(f) > C T (e) d F {Z{S{ ai /a Q )f),Z{Tf)) < e. 

This proposition is proved as Corollary 14.41 in Section [H In Section [2] 
we collect the necessary definitions and background. Section [3] deals with 
operators T G V n given by (jl.ip with a\ = 0. Here, Theorem 13.11 provides 
the key step towards the proof of the Proposition stated above. In Section d] 
we prove more results involving Frechet distance. We conclude with a few 
examples in Section [5j 

A natural application of our results is towards a better estimate of the 
regions where the roots of polynomials are located. In Corollary 13.31 we il- 
lustrate the possible range of applications by considering the perturbations 
induced by operators of the form T = I + aD, which have been traditionally 
of interest in the study of polynomial roots, see the books [7] and [9] for 
details. This result is a variation on the often quoted classical result by 
Takagi, see [TUl part (VI)] or (9J Corollary 5.4.1 (iii)]. Our result, in some 
cases, gives more precise information than the classical result. This is illus- 
trated in Example 15.31 which also gives a precise explanation of the behavior 
illustrated in Figures 1 and 2 above. 

2. Definitions and preliminaries 

Definition 2.1. Let p be a polynomial of degree n, n > 2, and assume that 
/ has at least two distinct roots. Define 

sep(/) := min{|u; — v\ : w,v G Z(f),w / u} 

and 

r(/):=min{|7i;-i;| : w G Z(f),v G Z(f')\{w}} . 

We will need the following inequality which was established in [31 Theo- 
rem 4]. 

Theorem 2.2. Let f be a polynomial of degree n, n > 2, and assume that 
f has at least two distinct roots. Then 

- sep(/) < r(/) < , 1 sep(/). 
n 2sm(7r/n) 
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Remark 2.3. The quantity sep(/) is known as the minimum root separation 
of a polynomial /. In [3] we denoted it by oj(f). Finding lower estimates for 
sep(/) is important since computing time required by an algorithm to isolate 
the roots of / depends inversely on sep(/), see for example [2j. In this sense 
roots of polynomials with large sep(/) are easy to find. Our results below 
indicate that such roots are also stable under the linear transformations of 
the form ([EE]) . 

Remark 2.4. Most of our results below are formulated in terms of the quan- 
tity t(/). Since we consider n fixed throughout, Theorem 12.21 yields that 
analogous results hold when r(/) is replaced by sep(/). 

Next we define the Frechet distance (see [H Section 3] for details and 
references) between multisets with the same cardinality. 

Definition 2.5. Let m be a positive integer and put M = {l,...,m}. 
By II m we denote the set of all permutations of M. For two functions 
u, v : M — > C we define 

cIf(u,v) := min max \u(k) — v(a(k)) \. 

The adaptation of this definition to cIf(A,B), where both A and B are 
multisets of size n, is then straightforward. 

The following theorem is a combination of Theorems 9.1 and 11.1 from 
[5]. The reader should notice that Rr(f) = dh(Z(f),Z(Tf)), dh being the 
notation used in [5]. 

Theorem 2.6. Let T £ C(V n ). The following statements are equivalent: 

(a) There exist ao,a\, . . . ,a n 6 C such that 

(2.1) T = a I + a l D + --- + a n D n , a / 0. 

(b) For all non-constant polynomials f £ V n we have 

RtU) < Rt{4>u). 

(c) For all f £ V n we have deg(T/) = deg(/) and there exists a constant 
C' T such that d F (Z(f), Z(Tf)) < C' T for all non- constant f £ V n . 

If T £ C(V n ) is given by (|2.ip . then the smallest constant C' T which 
satisfies (jcj) in Theorem 12.61 is denoted by Kp(T). 



3. Perturbations by a special class of operators 

The following theorem is the key result which is used in the rest of the 
article. The main point of interest is that it treats perturbations by operators 
T of the type (12. ip where the D term in the expansion of T is missing. In 
the rest of the article we assume n > 2. 
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Theorem 3.1. Let ax, . . . , a n G C and let T G C(V n ) be given by 
(3.1) T = I + ai D + ■■■ + a n D n . 

Then a± = if and only if there exists a constant Ft > such that for all 
f G V n with simple roots we have 

(3-2) r(f)R T (f)<T T . 

Proof. Assume that T is given by (13. ip and that ct\ = 0. Let / G V n be 
a polynomial with simple roots. We proceed with the construction of IV 
in two steps. First, if t(/) < 2Rx(<j) n ) + 1, then Theorem 12.61 immediately 
yields 

(3.3) r{f)<2R T {(j> n ) + l =► MfW) < M^n) {2R T (<l>n) + l) ■ 
Next, we consider the case 

(3.4) r(/) > 2R T {(j> n ) + 1. 

Our goal is to construct an upper bound for r{f)Rx{f) which does not 
depend on /. By the definition of Rt{$) there exist x\ G Z(f) and v\ G 
Z(Tf) such that Rt(/) = \x± — v\\. In addition, we can choose x\ and v\ 
so that 

(3.5) Rr(f) = |^i — ^i| < | a; — ui| for all x G 2(/). 
The polynomial g and the numbers &o, bi, ■ ■ ■ , b n defined by 

g(z) := (S( Vl )f) ( Z )=b + b 1 z+ b ^z + --- + b ^z n 

will play an important role in the proof. Clearly Z(g) = {— v±} + Z(f). 
Hence, by (|3.5p . w\ = x± — v\ is a root of g with smallest modulus. 

Next we explore the roots of g'. Note that Z(g') = {— v±} + Z(f'). Let 
z G Z(f') be arbitrary. Since the roots of / are simple, z 7^ x\. By the 
definition of r(/), (|3.5p . and (13. 4p we have 

1-2 — ^l| > l-Z — X\\ — \X\ — V\\ 

> r(f) - R T {f) 

> 2R T {cj )n ) + 1 - R T (<f> n ) 
= R T (<p n ) + 1. 

Thus, 

(3.6) |u| > «T(<^n) + 1 > for all u G Z(g'). 
In particular, 

61 = g'(0) + 0. 

Now recall that v\ G Z(Tf), oi\ = 0, and observe that S{v{) and T 
commute, to deduce 

n 

(3.7) = (Tf)( Vl ) = (S( Vl )Tf)(0) = (Tg)(0) = b + Y, a kb k - 

k=2 
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Since w\ is a root of g with smallest modulus, Viete's formulas for g imply 

|&i||wt| < ra|&o|- 
Together with (|3.7p this yields 



(3.8) 



\Wl\ 



- I^tIZ KIN- 



k=2 



Next we consider the polynomial 



h(z) := z n - x g'{\jz) = b x z n ~ x + b 2 z n ~ 2 + ...+ 



Jn-l 



■Z + 



(n-2)! (n-1)! 

We recall that b\ 7^ and we notice that for k = 2, . . . , n, the number 
|&fc|/(l&i|(& ~~ 1)0 is the modulus of the coefficient of z n ~ k in the monic 
polynomial h/b\. Further, let u\ be a root of g' with minimal modulus. 
Then the largest modulus among the roots of h is l/|ui| and, again by 
Viete's formulas, we have 



(3.9) 



b k 



h(k-l)\ 



< 



n — 1 



k — 1/ \ui 



ifc-i 



k = 2, . . . , n. 



Since the function x 1— > ((l + x) n 1 — lj/x,x > 0, is increasing, (|3.6 
implies 



(3.10) \ui\ 1 + 



n-l 



1 < (I2r(0n) + 1) 



ff R T (<M + 2 
ARtM + 1 



n-l 



1 • 



Next we use ()3.8|) . (|3.9p and f|3. lOj) to establish an upper estimate for 
\uiWi\: 



\uiW\\ < \u\\n 



\ak\\h\ 



k=2 



< \u±\ n ( max Im-lfA; — 1)! > 

\ — — / / — o 



< \ ui \n(max n \a k \(k-iy)j2 



k=2 
n-l 



h(k-l) 

n-l\ 1 



k=l 



luiln max \otu\(k — 1)! 

\2<k<n 



1 + 



^ \ n—1 



\ u l\ 



< n ( max \a k \{k - 1)! ) {R T (M + l) 



((R T {<j) n ) + 2 



n-l 



1 • 



^i?T(0n) + l / 

For further reference we denote the last expression by Ty. Hence we proved 

(3.11) l^i^il < r T . 
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Recall that u\ G Z{g') and w\ E Z(g). Since the roots of g are simple, 
w\ 7^ iii and hence 

| it! - wi\ > r(g) = r(/). 
By Theorem E2] and (|33|) 

RtU) < R T (4>n) < r(/)/2. 
Now, the triangle inequality and (13,50 yield 

|t*i] > \ui — w±\ — \w\\ 

> r(f) - R T (4> n ) 

> r(/)/2. 

Since by our choice \u>i\ = \xi — v±\ = Rt(/), (|3.1ip now yields 
(3.12) 7-(/)ii r (/)<2|«i|K|<2r , r . 

Thus, we have proved the implication (|3.4|) => (|3.12p . To conclude the proof 
of the "only if" part of the theorem, set 

T T := max{R T ((t> n )(2R T (<t> n ) + l),2T' T }. 



With this ry the implication (|3.4p (|3.2[) clearly holds. Finally, recall the 
implication (|3.3f) and the "only if" part is proved. 

To prove the "if" part of the theorem, assume that T is given by (|3.1D 
and that (|3.2[) holds for all / £ "P n with simple roots. Let a > be arbitrary 
and as before ip a ,2{z) = z 2 — a 2 . Then a and —a are the roots of ^ a ,2 and 

T~(lpa,2) = a. Also, 

(TiPa i2 )(z) = z 2 + 2a lZ -a 2 + 2a 2 . 
The roots of Tip a 2 are 



-OL\ 



a\ + a 2 — 2a 2 and z a ^ 2 = —ol\ + y af + a 2 — 2a2- 



Since clearly 



lim 
lim 

a— >+oo 

lim 

a— »+oo 

lim 



a 2 + a 2 — 2a2 



ai + v/a 2 + a — 2«2 



a? + a 2 — 2a; 



we conclude 



""I + + a2 ~ 2a 2 

-ai 



lim R T (ip a ,2) = I «i| 

a— >+oo 



|«l|, 

+OO, 
+OO, 



Since by (|3.2p for all a > we have Rri^afi) < ^V/a, letting a — > +00 leads 
to |ai| = 0. This completes the proof. □ 
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Corollary 3.2. Let a.2, . . . ,a n G C. Let T G C(V n ) be given by 

T = I + a 2 D 2 + ■■■ + a n D n . 

Let f G V n be a polynomial with simple roots such that r(f) > 2Rx{(j)n) + 1- 
Then 

Z(Tf)cZ(f)+B{T' T /r(f)), 
with Y' T as defined in the sentence preceding (13. 111) . 

Proof. The corollary is in fact a restatement of the implication (|3.4p =^ (|3.12p 
which is proved as a part of the proof of Theorem 13.11 □ 

The following corollary is inspired by [91 Corollary 5.4.1(iii)]. 

Corollary 3.3. Let a G C and let T G C(V n ) be given by T = L + aD. 
Let f G V n , n > 2, 6e a polynomial with simple roots such that r(/) > 
2|a|(ra - 1) + 1. TTien 

(3.13) Z(Tf) C {-a} + Z(f) + D( 7q /t(/)) , 

where -y a is given by 

2 "(H(»- 1) + ^R^Flf " 1 ) n "{ |a| *( 1 " ^' fc = V "4 

Proof. Corollary 13.21 does not apply to the operator T G C(V n ). Therefore 
we consider the composition V = S(—a)T G jC(V n ) where S(—a) G £(V n ) 
is defined by (S(—a)f)(z) = f(z — a). The Taylor formula at z implies 
that S(— a) = s ^Jk = Q{—oi) k D k /k\. Hence S(— a) and T commute. Since 
D G C(V n ) is the differentiation operator on V n we have D n+1 = 0. These 
observations lead to the following expression for V as a linear combination 
of derivatives: 

n ( \k 
V = (I + aD)j: t % L D* 

k=0 

E \- a ) D k _ \- a ) D k+i 

k=0 k=0 



n 



k=l 

= / D 2 H IT h ^ — - — w — D n . 

2 3 (n-2)!n 

Hence, Corollary 13.21 applies to V. To show that T' v = 7 Q we first calculate 
(T(j) n )(z) = z n + cmz"" 1 and deduce Z(T(p n ) = {0, —an}. Thus, Rrifpn) = 
\a\n. Also, 

Z(y<f> n ) = Z(S(-a)T(f) n ) = {a} + {0, -an} = {a, a(l - n)}, 
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and therefore, Rv(4>n) = \a\(n — 1). As we calculated the coefficients of V 
to be 

(-l) k+1 a k 
ak= (ifc-2)!fc' k = 2 -'-^ 

we have -f a = T' v . 

Since we assume that r(/) > 2|a|(n— 1) + 1 = 2Rv(4>n) + 1, Corollary 13.21 
yields 

Z(Vf)cZ(f)+B( 7a /r(f)). 
To obtain the inclusion in the corollary, substitute / with S(a)f and notice 
that r(S(a)f) = r(f), VS{a)f = Tf and Z{S(a)f) = {-a} + Z(f). □ 

Remark 3.4. In our notation the inclusion proved in [9J Corollary 5.4.1 (hi)] 
reads 

(3.14) Z{Tf) C {-an/2} + Z{f) +B(\a\n/2). 

Notice that if n > 1 and 

7a 



r(f) >max<^2|a|(n-l) + l, 

then the intersection of the right-hand sides of (|3.13p and (|3.14p provides 
improved information about the location of the roots of Tf. Moreover, if 

r(f) > max I 2\a\(n - 1) + 1, ^ 

I M 

then ()3. 13f) is an improvement of (13. 14h , the improvement being considerable 
for large r(/). 

Example 3.5. In this example we give a hint of the problems that can arise 
if we consider polynomials with multiple roots. For a > set 

g a (z) = z 2 (z - a) 2 . 

Let 

T = I + D 2 . 

Since g' a (z) = 2z{z — a)(2z — a), we have r(g a ) = a/2. On the other hand, 
we have 

(Tg a )(z) = g a (z) + g"(z) 
= z 



4 2az 3 + (o 2 + 12) z 2 - Ylaz + 2a 2 



a 2 + ( 12-^-] (z-^Y-t 



16 V 2 ) \ 2) 

Thus the roots of Tg a are symmetric with respect to the real axis and to 
the vertical line Re(z) = a/2. So, it is sufficient to calculate one root of Tg a 
with a positive imaginary part and real part less than a/2. For a > 3\/2 
such a root is given by 



Za,l 



V" 2 -24- W2a 2 — 36^ 
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and the other three roots are 

Za,2 = Z^l, Za,3 = Z a ,l + O, ^a,4 = + «■ 

By rationalizing and then simplifying the expression for z a ^\ it is not difficult 
to prove that 

lim z a i = v2i. 

Consequently, 

lim (z a>2 - (-V2i)) = 0, 

a— >+oo 

lim (za,3- (a + >/2i)) = 0, 

a— »+oo 

lim (z a 4 — (a — V2i)) = 0. 

a^+oo ' ' 

Since Z(g a ) = {0,a}, the last four equalities imply 

lim R T (g a ) = V2. 



a— »+oo 



As t(<7<j) = a/2, the function a i— > T(g a )RT(g a ), a > 0, is unbounded. Hence, 
the assumption of the simplicity of the roots in Theorem 13.11 cannot be 
dropped. 

4. Results involving the Frechet distance 

In the previous section we used Rt(/) as a measure of the distance be- 
tween the roots of / and the roots of Tf. In this section we work with the 
Frechet distance between the roots of polynomials / and Tf. Here Z(f) and 
Z(Tf) are considered as multisets of roots. The Frechet distance is defined 
only for nonconstant polynomials with equal degrees. In each of the cases 
below, the fact that the degrees of / and Tf are equal follows from The- 
orem 12.61 In particular, S(a) does not change the degree of a polynomial, 
and thus all the Frechet distances used below are well defined. Recall that 
the number Kp{T) is defined immediately after Theorem 12 .61 The following 
lemma provides a connection with the results from Section 02 

Lemma 4.1. Let T £ C{V n ) be given by 

(4.1) T = oeol + ct\D + • • • + a n D n , ao, ct\, . . . , a n € C, ao ^ 0. 
Let f 6 V n be a polynomial with simple roots such that 

(4.2) R T (f)<l and r(/) > 1 + f ^ ■ 

sm(7r/nj 

Then 



(4.3) 



d F {Z(f),Z(Tf))=R T (f). 
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Proof. Let / G V n be a polynomial with simple roots satisfying (|4.2|) . Then 
Theorem 12.21 yields 

(4.4) sep(/) > 2 + 2K F {T) > 2. 
Further, by the definition of K F (T), we have 

d F (Z(f),Z(Tf)) <K F (T). 

Now assume that the degree of / is m and let Z(f) = {zi, . . . , z m }, where 
z\, . . . , z m are distinct complex numbers. By the definition of the Frechet dis- 
tance d F (Z(f), Z(Tf)) the roots w±, . . . , w m of Tf (recall that also degTf = 
m) can be indexed in such a way that 

(4.5) \zj -Wj\ < Kp (T) , j = 1, . . . , m. 

Let j and k be distinct numbers from {1, . . . , m}. Then, Zj ^ Zk and the 
triangle inequality, (14. 4h and (|4.5j) yield 

\Wj-W k \ = \(Wj - Zj) + (Zj - z k ) + (z k - w k )\ 

> \Zj - Z k \ - \(Wj - Zj) + (z k - w k )\ 

> \Zj - Z k \ - (\Wj - Zj\ + \z k - w k \) 

> \Zj - Z k \ - \Zj - Wj\ - \z k - w k \ 

> sep(f) - 2K F (T) 

> 2. 

Consequently, 

sep(T/) > 2. 

Therefore, the disks {wj} + B(l) (for j G {1, . . . ,m}) are pairwise disjoint. 
Since Rr(f) < 1 and sep(/) > 2, in each disk {wj} +B(1), j G {1, . . . ,m}, 
there is exactly one root of /. Renumber the roots of / in such a way that 

Zj G { W j} + B(l), je{l,...,m}. 

Then 

d F (Z(f) ) Z(Tf))<max{\w j -z j \:je{l,...,m}}=R T (f). 
Since clearly R T (f) < d F (Z(f), Z(Tf)), g3J follows. □ 
Theorem 4.2. Let n > 2 and let T G £("P n ) be given by 

(4.6) T = I + a 2 D 2 + --- + a n D n , a 2 ,...,a n eC. 

Then for every e > there exists Ct(s) > such that for all f G V n with 
simple roots the following implication holds: 

r{f)>C T {e) d F (Z(f),Z(Tf)) < e. 

Proof. Let T' T > be the constant introduced immediately above inequality 
(I3TTT) . For e > set 



(4.7) 



C T (e) = max/UK^) + 1,I* T) Sl ^ + f^ l . 

[ e sm(7r/n) J 
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Let / £ V n be a polynomial with simple roots such that 

r(/) > C T (e). 
Since r(/) > RT(4> n ) + 1, Corollary 13.21 implies 

(4-8) i? T (/) < 

Since r(/) > Fj., Rrif) < 1- Hence, Lemma I4TT1 yields 

d F (Z(f),Z(Tf))=R T (f). 
Since r(/) > T^/e, (gSJ) implies 

fl T (/) < e. 

The last two displayed relations prove the theorem. □ 

The following proposition is proved by combining the methods of proofs of 
Theorems 13.11 and 14 . 2 [ In the proposition we use Tt defined in Theorem 13. 11 

Proposition 4.3. IfTe C(V n ) is given by (14. 6j) . then for all f G "P„, wii/i 
simple roots we have 

r(f) d F {Z(f),Z(Tf)) < max \t t , K f (T)T t , K^T) 1 + f F } T) ) . 

I sm(7r/nj J 

Proof. Set 

T T := max <^ T T , .... > . 

[ sm(7r/nj J 

Let / E P n be a polynomial with simple roots. As in the proof of Theo- 
rem l3.1l we proceed in two steps. First, if r(/) < r^, then, by the definition 
of Kp(T), 

T(f)d F {Z(f),Z(Tf)) <T' T K F (T) <max{r T ,r^ F (T)} . 
For the second step assume r(/) > T' T . Recall that by Theorem 13.11 
(4.9) T(f)R T (f)<T T <T' T . 

Consequently, R F {f) < 1- Since also r(/) > (l + K F (T)) /(sm(ir/n)), 
Lemma 14.11 implies 

d F (Z(f),Z(Tf))=R T (f). 
Substituting this identity in (|4.9p . we conclude 

r{f) d F (Z(f),Z(Tf)) <T T < max {T T , T' T K F (T)} , 
and the proposition is proved. □ 

Theorem 14.21 is of course a consequence of Proposition 14.31 but Cr(e) 
given by (14. 7p is smaller than the corresponding constant deduced from 
Proposition 14.31 

Now we have all the tools to prove the proposition stated in the Intro- 
duction. 
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Corollary 4.4. Let T £ C(V n ) be given by (14. ip . T/ien /or et>ert/ e > 
there exists Ct(s) > such that for all f £ V n with simple roots we have 

r(f) > C T {e) d F (Z(S( ai /a )f), Z(Tf)) < e. 

Proof. To prove the corollary we observe the identity 

d F {Z(S( ai /a )f),Z(Tf)) =d F {Z(f),Z(S(- ai /ao)Tf)), feV n \V . 

Next we notice that Theorem 14.21 applies to the operator 

T x := —S(- ai /a )T. 
a 

Then the corollary follows from Theorem 14.21 □ 

A simple calculation shows that for t > we have 

r{H(t)f)=tr(f), 
and thus we note the following corollary. 

Corollary 4.5. Let T £ C(V n ) be given by (|4.ip . For an arbitrary f £ V n 
with simple roots we have 

^ x d F (z(S( ai /a®)H(t)f),Z(TH(t)fy) = 

Now we return to a simple motivating example from the Introduction. 
Finding the limits of the expressions in f) 1 . 2 j) is a standard calculus exercise. 
Since the expressions in (|1.2j) represent distances between roots of two qua- 
dratic polynomials, the following corollary can be seen as a generalization 
of this standard calculus exercise. 

Corollary 4.6. Let ai, . . . , a n -\ be arbitrary, but fixed, complex numbers. 
For a G C set 

g a (z)=z n + a n and f a (z) = z n + a n - 1 z n ~ 1 -\ h a\z + a n '. 

Then 

lim dJz(S(a n - 1 /n)g a ),Z(f a )) = 0. 

\a\— >+oo V / 

Proof. Setting 

(n-k)\ 

a = l, a k = j — a„_ fc , k = 1, . . . , n - 1, a n = 0, 

n! 

in (|4.ip we have T<7 a = f a . Since clearly r(g a ) = |a|, the corollary follows 
from Corollary 14.41 □ 

Corollary 14.61 indicates that for a monic polynomial f a with constant term 
a n of large magnitude the approximate location of its roots depends only 
on a, n, and the coefficient a n _i. The following corollary formalizes this 
observation. 
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Corollary 4.7. Let ai, . . . , a n _i be arbitrary, but fixed, complex numbers. 
For a G C set 

/i a (z) = z n + a n _iz n ~ 1 + a n and /„(*) = z n + a n - X z n ~ x + • • • + a x z + a™. 
Then 

lim d F (Z(/i a ),Z(/ a )) =0. 

|a|— >+oo 



Proof. Let ^(-z) = — a n . By Corollary 14.61 

lim dJz(S(a n -i/n)g a ),Z(f a 

|d| — »+oo \ 

and 

lim d F (Z(5(a n _i/n)g a ),Z(/i a )) =0. 

| d| — »+oo \ / 

Since by [U Proposition 3.1], dp is a metric on the space of root sets Z(-), 
d F (Z(h a ),Z(f a )) 

< d F (z(S(a n -i/n)g a ),Z(f a )j + d F \Z{S{a n - 1 /n)g a ),Z(h a ) 
The corollary follows from the last three displayed relations. □ 

5. Estimates and examples 

Example 5.1. A simple way to see our results in action, in particular 
Theorem 13.11 is to look at an example for which we can exactly calculate 
t(/) and Rx(f). In the Introduction we considered T = I + D n . Here 
a\ = 0, so Theorem 13.11 applies. Further, Rt(4>u) = {n\) l / n and, for n > 3, 

. ; \ 71—1 



T T = 2n!((n!) 1 / n + 1) 
A rough estimate yields 



(n!) 1 /" + 1 



n-l 



n-1 



T T < 2(^2 + 1) n! 
Hence, by Theorem 13.11 

(5-1) r(/)iM/)<2(x/2 + l)n!(j) 

for all / £ with simple zeros. Let a > and ipa,n(z) = z n — a n . Then 
T{^ a ,n) = o, and (Tip a ^ n )(z) = z n — (a n — n\). It is not difficult to see that 

R T ^a,n) = \a-{a n -n\) 1/n \, 

where, for < a < (n!) 1 /™, the formula (a n — n!) 1//n denotes the root with 
argument 7r/(2n). Further, elementary considerations yield 

r(^ n )R T (^ n ) = a\a-(a n -nl) 1/n \ < (n\f n . 
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This is certainly much better than (|5.1f) . However, (|5.ip holds for all f G V n 
with simple zeros. 

Remark 5.2. Example 15.11 indicates that the constant in Theorem 13.11 
might not be close to optimal, at all. We note that Theorem 13.11 together 
with Theorem 12.21 yields information about the maximum root separation 
sep(/): 

sep(/) J R T (/) < nT T . 

In pQ , Collins provides us with an impressive amount of numerical evidence 
for what should be some general "ideal" lower bound for sep(/). Still his 
work also highlights how far the current theoretical tools, for example [8], 
are from getting close to the numerical conjectures. The same difficulty 
might also be at work here. 

Example 5.3. Here we apply Corollary 13.31 to the polynomials ip a ,5(z) = 
z 5 — a 5 , a > 0, and the operator T = I + D to provide a precise explanation 
for the behavior of the roots exhibited in Figures 1 and 2. In this case we 
have r(V>a,5) = a and (T^ a ^){z) = z 5 + hz n ~ x - a 5 . By Corollary E31 if 
a > 9 we have 

Z(T^) C {-l} + Z(V>a, 5 )+B(7l/a), 
where 71 = 42.944. In Figures 1 and 2 we used a = 45. Thus 

Rt(^5,s) < li/a = 0.9543li. 

In terms of the geometric objects in Figures 1 and 2 the last inequality 
predicts that the maximum distance between black dots and crosses, paired 
in the natural way, is < 0.954312. In fact Mathematica gives 0.046083 as 
the maximum distance. Our estimate therefore overestimates 0.046083 by 
means of 0.954312, which is not impressive. But it can still be considered 
satisfactory because the upper bound from Corollary 13.31 or more generally 
from Theorem 13. T\ holds under the most general conditions. 

Finally we notice that [9j Corollary 5.4.1 (hi)] in this specific case would 
only give 

Z(2>45, 5 ) C {-5/2} + Z(^45, 5 ) + 0(5/2) , 

which is much less precise information. This is illustrated in Figure 3, where 
the boundaries of the disks in the last inclusion are gray, the smaller disks 
from our estimate are outlined in black, and the roots of 

TV>45,5 = z 5 + 5z 4 - 45 5 

are marked by black dots. In a magnified view in Figure 4 the gray point 
marks 45 exp(67ri/5), while the cross is at 45 exp(67n/5) — 1. 

Acknowledgment. We are grateful to the referees for many valuable sug- 
gestions that led to a much improved paper. 
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Figure 3: A comparison of estimates for the roots of z 5 + 5z 4 — 45 5 
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